Rank | Count | Beginning |
---|---|---|
91436 | 6879 | ეს |
17348 | 5368 | ამ |
164445 | 3827 | მე |
71686 | 3409 | და |
217102 | 3351 | როგორც |
115584 | 3243 | თუ |
206370 | 2372 | რა |
149671 | 2330 | მაგრამ |
177742 | 2108 | მისი |
33352 | 1963 | არ |
129897 | 1948 | ის |
212404 | 1948 | რაც |
235135 | 1843 | საქართველოს |
23332 | 1770 | ამის |
278800 | 1756 | ჩვენ |
118045 | 1420 | თუმცა |
50789 | 1331 | ახლა |
285550 | 1253 | ძალიან |
40008 | 1182 | ასე |
118043 | 1177 | თუმცა, |
160335 | 1157 | მას |
83792 | 1140 | დღეს |
276854 | 1114 | ჩემი |
89161 | 1110 | ერთი |
152955 | 1091 | მათ |
156489 | 986 | მან |
123967 | 968 | იგი |
287879 | 944 | ძიება |
169756 | 919 | მერე |
246100 | 881 | სწორედ |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV